Enhancing Hotelling's T2 Statistic using Shrinkage Covariance Matrix for Identifying Differentially Expressed Gene Sets

نویسندگان

  • Suryaefiza Karjanto
  • Rasimah Aripin
  • Norazan Mohamed Ramli
  • Nor Azura Md Ghani
چکیده

The breakthrough of microarray technology is a vital research instrument to measure the quantitative and highly parallel of gene expression. In microarray studies, it is common that the data set typically consists of tens of thousands of genes (variables) from just dozens of samples due to various constraints including the high cost of producing microarray chips. As a result, the combined sample covariance matrix in Hotelling’s T statistic is not invertible. Therefore the distribution of the resulting statistic is either unknown or insufficient. In this study, shrinkage covariance matrix is proposed to improve Hotelling’s T statistic for identification of differentially expressed gene sets. The use of shrinkage covariance matrix overcomes the non-singularity problem in the estimation of sample covariance matrix when the number of variables is larger than the number of samples. The performance of the proposed method was measured using simulation study. The result implies that this approach outperforms existing techniques in many conditions tested.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of univariate and multivariate gene set analysis in acute lymphoblastic leukemia.

BACKGROUND Gene set analysis (GSA) incorporates biological with statistical knowledge to identify gene sets which are differentially expressed that between two or more phenotypes. MATERIALS AND METHODS In this paper gene sets differentially expressed between acute lymphoblastic leukaemia (ALL) with BCR-ABL and those with no observed cytogenetic abnormalities were determined by GSA methods. Th...

متن کامل

Statistical methods of translating microarray data into clinically relevant diagnostic information in colorectal cancer

MOTIVATION It is a common practice in cancer microarray experiments that a normal tissue is collected from the same individual from whom the tumor tissue was taken. The indirect design is usually adopted for the experiment that uses a common reference RNA hybridized both to normal and tumor tissues. However, it is often the case that the test material is not large enough for the experimenter to...

متن کامل

Shrinkage-based diagonal Hotelling's tests for high-dimensional small sample size data

High-throughput expression profiling techniques bring novel tools and also statistical challenges to genetic research. In addition to detecting differentially expressed genes, testing the significance of gene sets or pathway analysis has been recognized as an equally important problem. Owing to the ‘‘large p small n’’ paradigm, the traditional Hotelling’s T 2 test suffers from the singularity p...

متن کامل

Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets

MOTIVATION Recently, many univariate and several multivariate approaches have been suggested for testing differential expression of gene sets between different phenotypes. However, despite a wealth of literature studying their performance on simulated and real biological data, still there is a need to quantify their relative performance when they are testing different null hypotheses. RESULTS...

متن کامل

Accurate ranking of differentially expressed genes by a distribution-free shrinkage approach.

High-dimensional case-control analysis is encountered in many different settings in genomics. In order to rank genes accordingly, many different scores have been proposed, ranging from ad hoc modifications of the ordinary t statistic to complicated hierarchical Bayesian models. Here, we introduce the "shrinkage t" statistic that is based on a novel and model-free shrinkage estimate of the varia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014